﻿ Appeared in Proceedings of the LREC 2002 - Third International Conference on Language Resources and Evaluation, Las Palmas de Gran Canaria, Spain, 29-31 May 2002, vol VI, p 2000-2007 AR-Engine – a framework for unrestricted co-reference resolution Dan Cristea*, Oana-Diana Postolache*, Gabriela-Eugenia Dima■, Cătălina Barbu□ *University "Al I Cuza" of Iaşi, Faculty of Computer Science, 16 Berthelot St , 6600 Iaşi, Romania {dcristea, oanap}@infoiasi ro ■ University "Al I Cuza" of Iaşi, Faculty of Letters, 11, Carol I Bvd , 6600 Iaşi, Romania g dima@mail uaic ro □ University of Wolverhampton, School of Humanities, Languages and Social Sciences, Stafford St , WV1 1SB, Wolverhampton, United Kingdom C Barbu@wlv ac uk Abstract The paper presents a framework that allows the design, realisation and validation of different anaphora resolution models on real texts The type of processing implemented by the engine is an incremental one, simulating the reading of texts by humans Advanced behaviour like postponed resolution and accumulation of values for features of the discourse entities during reading is implemented Four models are defined, plugged in the framework and tested on a small corpus The approach is open to any type of anaphora resolution However, the models reported deal only with co-reference anaphora, independent of the type of the anaphor It is shown that the setting on of more and more features, generally results in an improvement of the analysis 1 Introduction It is well known that an algorithm of anaphora text layer ……… …………………… REaREb resolution (AR) with a very high degree of success has not RE b projects attrb projects FSb been found yet In (Cristea and Dima, 2000) a framework REb able to easily accommodate different approaches in projection layer …………………… FS FSa anaphora resolution was proposed The central notion in b proposes DEa the framework is that of anaphora resolution model In FSaFSb evokes DEa this paper we describe a group of experiments with the semantic layer ………… DE a framework, used as a workbench, and show an ascending precision and recall series of results that are obtained by Figure 1 Three-layer representation of AR models that have more and more features turned-on co-referring referential expressions The experiments report results on co-reference resolution that involves any type of noun phrases as anaphors The work described here relies heavily on annotated language 2 1 Definition of an AR model resources In the definition of an AR approach the notion of AR In section 2 we describe a framework that allows easy model is basic In (Cristea and Dima, 2001) such a model design, implementation and evaluation of any AR model is described as having four components: The section 3 presents the small corpus we used for our - a set of primary attributes that fill the projection research and the experiments pursued Section 4 illustrates layer and are then reported to the semantic layer; a series of four models that were used for the experiments - a set of knowledge sources or virtual processors and section 5 reports the results fetching values for the attributes during the incremental text processing; 2 A framework for anaphora resolution - a set of heuristics or rules intended to co-operate in In (Cristea and Dima, 2000) the anaphora resolution order to decide if an RE introduces a new discourse process is interpreted as involving three layers: the text entity and, if not, what existing DEs it co-refers; layer – populated with referential expressions (REs) –, the - a set of rules that configure the domain of referential intermediate layer – where feature structures (FSs) are accessibility, therefore establishing the order in filled-in with information from the text –, and the deep which DEs have to be checked semantic layer – where descriptions of discourse entities In order for the framework to accommodate an AR (DEs) are placed We say that an FS is projected from an approach, a specific model has to be plugged in the RE and that a DE is evoked by an FS (Figure 1) framework The framework, at least in principle, allows The type of analysis supported by the framework is for unrestricted AR and only the plugged-in model incremental; therefore the order of processing the dictates whether the resolution involves only co- anaphors simulates the human reading Just like in normal references or any type of anaphora and whether it takes reading, anaphors are mostly resolved at the time of into consideration only pronouns or any type of referential reading, but sometimes decisions are postponed until the expressions acquisition of complementary information that helps the An analysis of the existent approaches (Mitkov, 2002) disambiguation process It is like when backwards eye helps classify the attributes into the following categories: movements reveal indecisions (Vonk, 1985) a morphological: f others: - lexical number; - inclusion or not of the RE in a specific lexical field, - lexical gender; dominant in the text (called domain concept in - person Mitkov, 1997); All the approaches use morphological criteria to filter out - frequency of the term in the text (Mitkov, 1997); antecedents However, the elimination of possible - occurrence of the term in a heading (Mitkov, 1997) referential candidates based on mismatches in Component 2 includes a set of knowledge sources morphological features may lead to errors as shown at fetching values for the attributes of the feature structures least by Barlow (1998) Morphology cannot be ignored, belonging to the projection layer but a less categorical approach is preferable What we understand by a knowledge source is a b syntactical: virtual processor able to fill in values for one single - full syntactic description of REs as constituents of a attribute on the projection layer, for instance lexical syntactic tree (Lappin and Leass, 1994); number, or lexical gender, or part of speech, or syntactic - marking of the syntactic role for subject position or role Practically, current processors simultaneously fetch obliqueness (the subcategorisation function in respect values for more than one such attribute Thus, a morpho- to the verb) of the REs: syntactic domain-based syntactic tagger represents several knowledge sources as it approaches (Chomsky, 1981; Reinhart, 1981; Gordon fills in values for more than one attribute of the head word and Hendricks, 1998; Kennedy and Boguraev, 1996) of the RE (Tufis, 2000) and all Centering-based approaches (Grosz, Joshi and At least two knowledge sources are fundamental to all Weinstein, 1995; Brennan, Friedman and Pollard, possible models of anaphora resolution: a part of speech 1987); tagger, to associate tags to every token of the text, and an - quality of being adjunct, embedded or complement of NP extractor, capable of recognising REs by grouping text a preposition (Kennedy and Boguraev, 1996); tokens into NPs These should be structured as head- - inclusion or not in an existential construction modifier compounds As there are NPs which cannot be (Kennedy and Boguraev, 1996); REs (i e the bucket within the verbal phrase to kick the - syntactic patterns in which the RE is involved, that bucket), a parser should reject such noun phrases, by can lead to the determination of syntactic parallelism running a set of regular expressions to discover phrasal (Kennedy and Boguraev, 1996; Mitkov, 1997) units c semantic: Besides the two processors above, currently existing - position of the head of the RE in a conceptual systems use additional knowledge sources For instance, hierarchy (hypo/hypernymy): all models using Kennedy and Boguraev (1996) introduce a marker of Wordnet (Poesio, Vieira and Teufel, 1997) Features syntactic function and a set of patterns which recognise as animacy, sex (or natural gender) and concreteness the expletive “it” (near specific sets of verbs or as subject could be considered simplified semantic tags derived of adjectives with clausal complements) Azzam, from a conceptual hierarchy; Humphreys and Gaizauskas (1998) use a syntactic - inclusion in a synonymy class that is determined by analyser, a semantic analyser, and an elementary event the context; finder Gordon and Hendrick (1998) employ a surface - semantic roles, out of which selectional restrictions, realisation identifier and a syntactic parser, while Hobbs inferential links, pragmatic limitations, semantic (1978) requires, for his semantic approach, a syntactic parallelism and object preference can be verified analyser, a surface realisation identifier and a set of d positional: axioms to determine semantic roles and relations of - offset of the first token of the RE (a noun phrase – lexical items NP) in the text (Kennedy and Boguraev, 1996); Component 3 contains a set of rules and heuristics - inclusion in an utterance, sentence or clause, intended to answer one or both of the following two considered as a discourse unit (Azzam, Humphreys questions involving the current FS: (1) Does it introduce a and Gaizauskas, 1998) The setting of this feature new discourse entity? (2) Which one (ones) of the existing allows the computation of the proximity between the DEs does it refer to and what is the referential relation anaphor and the antecedent If both the anaphor and between this FS and the respective DE (DEs), considered the antecedent are placed in the same unit than c-as antecedent(s)? The rules/heuristics perform the evoking commands criteria can be applied; if they are placed phase between an existent FS belonging to the projection in different units, then the number of units in between layer and DEs belonging to the semantic layer The could be used as a criterion in limiting the domain of mention of more than one DE in the second point above is referential accessibility (see Component 4 below) due to the observation that there are cases when an FS e surface realisation (type): (and, therefore, its corresponding RE) could be in more - zero-pronoun (also called zero-anaphora or non-text than just one referential relation to already identified DEs string), clitic pronoun, full pronoun, reflexive In the sequence John … his house, for instance, there are pronoun, possessive pronoun, demonstrative pronoun, three REs – John, his house, and his – and his is in a co- reciprocal pronoun, expletive “it”, bare noun referential relation with the entity [John] while being in (undetermined NP), indefinite NP, definite NP, an ownership relation with [his house] proper noun (name) In accordance with most authors, we accomplish this process by two types of rules: - certifying rules (applied first), which, if evaluated to applying these constrains to the position of a certain RE, true on a pair (FS, DE), certify without ambiguity the some DEs from among those already found as resolution DE as an antecedent of the FS; candidates could be hidden In the attentional-based - demolishing rules (applied after the certifying rules), approaches, the accessibility in the current discourse unit which rule out a possible DE as candidate of an FS is given by the top-down order of states in a focus stack (and its corresponding RE) These rules lead to a In the veins theory, the DRA of a unit is an initial part of a filtering phase that eliminates from among the sequence of discourse units, called vein, computed from candidates those discourse entities that cannot the discourse structure, considered a nucleus/satellite possibly be referred to by the RE under investigation; tagged binary tree - promoting/demoting rules (applied after the For the second task, different strategies could be demolishing rules), which increase/decrease a imagined The recency order is extensively used, placing resolution score associated with a pair (FS, DE) all the discourse entities referred from the REs in the DRA These rules allow for a subsequent selection phase, in in the most-recent-first order and eliminating duplicates (if which either the best candidate of an FS is chosen different REs refer the same DE, only the most recent from the ones remaining after the demolishing rules reference is kept, in the corresponding place of the list) have been applied, or a new entity is introduced Following this order, among the set of discourse entities, To refer again to the known approaches, in order to with respect to which the current anaphor displays the rule out possible candidates, Kennedy and Boguraev same resolution score, the one closest to it is selected (1996), for instance, implement conditions that prevent a For instance, a combination of a “no restriction” DRA pronoun to co-refer a constituent (NP) which contains it policy with the recency order gives the well-known linear Thus, in the child of his brother, his is neither child, nor order Alternatively, focus-based approaches (Sidner, brother, but a different entity For the remaining 1981; Azzam, Humphreys and Gaizauskas, 1998) use candidates, they compute the salience by weighing a set of registers for current focus, and alternate foci list, which attribute-values pairs The weights are linguistically and are updated after each sentence, and, based on them, experimentally justified (Keenan and Comrie, 1977; define an order in which to look for the antecedent In Lappin and Leass, 1994) Gordon and Hendricks (1997) their discourse prominence representation approach, show that the antecedent’s syntactic prominence (notion Gordon and Hendrick (1998) speak about “ordering of related to the relative distance in a syntactic tree) entities in the discourse order that determines the influences the selection of the co-referential candidate In accessibility of those entities as referents for subsequent (Gordon and Hendricks, 1998) the salience of the relations expressions” In VT the E-DRA of a discourse unit is the between names and pronouns is calculated by using a hierarchical DRA taken from the unit’s vein plus the extra gradation of surface realisation pairs: name-pronoun > units, taken as the least prominent The order of DEs as name-name > pronoun-name Therefore, if the surface possible antecedents of a given RE used in VT-like realisation of the anaphor is a name, then a candidate approaches is the recency order of DEs corresponding to whose surface realisation is a name will weigh higher than REs belonging to this list (therefore a combination of one whose surface realisation is a pronoun hierarchical E-DRA with the recency ordering) Apart from these types of rules, the builder of the model can also choose to implement heuristics intended 2 2 Processing in the framework to deal with still unresolved (postponed) candidates, in the preceding FSs The idea is that, at a certain moment The framework processes the text sequentially When during processing, the resolution of new anaphors could processing begins, the text layer contains the original text, bring new proofs for the disambiguation of old, with the sequence of REs, there is no FS built on the unresolved ones The processing moments for projection layer and the semantic layer is empty of DEs reconsidering postponed FS resolution, once a new link As the projection layer is only meant to support REs under has been added, varies from the processing moment of the resolution, no structure is maintained on this layer for the immediately following RE in the text, up to the processing resolved REs On the contrary, the semantic layer moment of the last RE belonging to the following maintains structures corresponding to all already found discourse unit We haven’t found cases when this DEs and is continuously updated with the information reconsideration should be done later than this moment At contributed by the processed co-referential FSs At the the same time, it is not clear yet whether an untying same time, these DEs define equivalence classes among mechanism should be incorporated into the framework in REs of the text layer, as all co-referential expressions refer order to reconsider the anaphoric “garden paths” to the same DE There is one link that is established Component 4 contains a procedure that configures between each RE and a specific DE, as in (Kennedy and the domain of referential accessibility This component Boguraev, 1996), and one DE points back to all REs in its has two tasks: to establish the semantic DEs that are open class As already mentioned, one DE accumulates all the to be referred to by a certain RE (forming the domain of relevant properties of its REs, for instance: natural gender referential accessibility – DRA) and to establish their and number, all lemmas and proper names that were used order according to some prominence criteria to refer that entity This is why the evoking rules usually To exemplify the first task, both the attentional state match gender and number by verifying inclusion of the theory (Grosz and Sidner, 1986) and the veins theory FSs values in the corresponding attribute values of the (Cristea, Ide and Romary, 1998) discuss accessibility DE, instead of performing an equality check is read, or shortly afterwards (to constraints that rely on the discourse structure By The moment an REx enable the existent knowledge sources to acquire enough information), a representation on the projection layer is keep in the candidates list of FSx built – the FSx The processing moment of an RE may be only the entries corresponding to delayed in case some knowledge sources require more text this range of scores and delete the than the span of tokens included in the RE itself or in the rest; immediately preceding context For instance, some return false;} else choose as antecedent of REx the DEj semantic features of REs are identifiable only after the 8 given by candidates idx(0), i e the processing of the verb they relate to, while zero pronouns first ranked candidate after sorting; can be identified only after the identification of the verb 9 replace DEj with the merge between FSx When all the declared knowledge sources in the and the previous content of DEj; second component of the model contributed with their 10 delete FSx from the projection layer; information, the recently build FSx on the projection layer 11 for each FSy remained on the projection has a filled-in list of attribute-value pairs: ai = vi, for all layer do{ , DRA)=false, where now attributes listed by the first component of the model if evoke(FSy equals the corresponding Suppose DRA=(DE1, … DEm) is the ordered list of DRA candidates idx list, then apply accessible discourse entities at the current moment, as h(FS, DRA), where h∈ H;} given by the fourth component of the model, with DE1 – return true;y12 the most prominent Let the three set of rules be denoted as follows: certifying rules – CR, demolishing rules – DR Steps 2, 3 and 4 describe actions of certifying rules, and scored rules – SR and the heuristics be denote by H demolishing rules and scored rules, respectively If a Among the attributes of the FSx, a candidates attribute is a certifying rule fired then the antecedent is found and the vector of pairs, where in each pair the first field points to a procedure continues with step 9 If a demolishing rule corresponding DE, and the second field is the resolution fires, the targeted DE is eliminated from the candidates score of the current anaphor REx with regard to the The remaining candidates are then sorted at step 5 in the corresponding DE as a possible antecedent Let’s denote descending order of the resolution scores computed by the the two slots of the elements in this list as idx and score, scored rules Step 6 describes the actions to be taken when respectively Then, the evoking phase of a co-reference a new discourse entity is proposed, due to poor matching type resolution corresponding to FSx and a given DRA list of the projected features with any of the already existent runs as follows: DEs Sometimes, two or even more structures could be maintained on the projection layer, as revealed at step 7 procedure evoke(FSx, DRA) 1 initialise the candidates list of FSx This happens in the case of postponed resolution, which is with one element for each discourse triggered by a too small ranking difference between the entity DEj in DRA as follows: best-ranked candidates The threshold values used in steps candidates idx := a pointer to a DEj, 6 and 7 (thresholdmin and thresholddiff) are included in the in the order given in DRA; model among other fine tuning parameters If just one DE candidates score := 0; neatly differentiates from the others of the DRA, then at 2 for each rule r∈ CR and each DEj∈DRA do{ step 8 it is taken as the antecedent In all cases when an if r(FSx,DEj) = true then { antecedent is found (at step 2 – as a result of a certifying assign DEj as the antecedent of REx; rule or at step 8 – by finding a well individualised go to 9;} candidate among others), the current FS injects 3 for each rule r∈ DR and each DEj do{ information into the existent DE structure – at step 9 This if r(FSx,DEj) = true then is the way in which we allow the DEs to evolve along eliminate DEj from the candidates with the unfolding of the discourse Then the current FS, list of FSx;} successfully resolved, may disappear from the projection 4 for each rule r∈ SR and each remaining DEj in the candidates list of FSx do{ layer – at step 10 Finally, at step 11, which is reached just in case the current FS is resolved, the remaining unsolved × ∑ wFSs rrs on the projection layer are again considered There SRrare two ways in which unsolved FSs can benefit from FSs candidates score := ∈ ∑ rwresolved at a later point in the discourse One is by ∈SRrrunning again the set of rules for just those candidates where wr is the weight of rule r and kept in its list Since resolved FSs merge their list of sr is the matching score of r attribute-values with the one of the assigned DE, it is applied to the pair (FSx,DEj);} possible that these new pieces of information injected into 5 sort the candidates list in the descending order of the score values DEs give the illuminating clue to help the disambiguation and then of the idx values; in a case of indecision For instance, in the sequence: 6 if candidates score(0) in a hypernymic characters from this short fragment (Winston Smith, chain of a synset of the target word contributes with a coded as [Winston], the girl with black hair – [the girl], SHE vote Any occurrence of the Wordnet concept the sand-like haired woman – [the woman], O'Brien – in a hypernymic chain of a synset of the variations, that we believed should have given an target word contributes with a HE vote All the increasing degree of resolution, as they used more and hypernymic chains of the target word that did not more features turned-on, therefore involving more and match neither the first criterion, nor the second, but more computational power For each co-referential meets an occurrence of the Wordnet concept anaphoric chain and for each model, we’ve computed the contributes with a SHE half-a- 4 The models used in the experiment vote and a HE half-a-vote, and finally all hypernymic chains of the target word that did not match any of the The first model (Base-model) is described by the above criteria, contribute with an IT vote Then all following quadruple: scores are normalized in the range 0-1 by dividing - set of attributes: POS (part of speech), LNUM them with the total number of votes A knowledge (lexical number), LGEN (lexical gender), LEM source fetching values for natural number was not yet (lexical lemma), NAME (proper name – if it is the implemented in this model; case); - set of matching rules: - set of knowledge sources: a POS-tagger (contributing o certifying rules: match-NAME – same as in the values to POS, LNUM, LGEN) and a lemmatiser base model; (contributing values to LEM and NAME); o scored rules: match-LNUM, match-LGEN, - set of matching rules: match-LEM – same as in the base model, then: match-NGEN(fs, de): true if that attribute in o certifying rules (in what follows fs and de are an  FS and a DE structure that are matched by the the set SHE, HE, IT in fs that has the rules): maximum value corresponds to that with the maximum value in DE, otherwise false; 1 Four paragraphs from part I, chapter I (924 words)  match-SYNO(fs, de): true if among the synsets When this prediction does not match the reality, what are accumulated in the SYNO attribute of the fs the reasons that led to this discrepancy? What conclusions there is one that belongs also to the set of can be drawn? Could the models be fine-tuned to repair synsets corresponding to the DE attribute the failures? SYNO; The most important conclusion seems to be the  match-HYPER(fs, de): true if a synset from possibility to combine different knowledge sources, from SYNO belongs to the SYNO attribute of DE different approaches, for which to design adequate or vice-versa; matching rules and heuristics, therefore to take what is - domain of referential accessibility: linear, as in the best of all studied models to the benefit of a better AR base model behaviour Except for VT related sources of model 4, that Model 3 (Centering-model): supplementary to the need discourse structure annotation and which couldn’t be preceding model, this model applies Centering Theory provided automatically yet, all sources involved in the (Grosz, Joshi, Weinstein, 1995; Brennan, Friedman, experiments were automatic, although manually revised Pollard, 1987) in order to disambiguate the postponed To a great extend, the results proved the initial references, i e there are chosen those particular pairs hypothesis, namely models behaved better and better as anaphor-antecedent that result in the smoothest transition more features were fired The best results proved 100% between adjacent utterances The Centering-model is precision and values of recall in the range 70% to 100% composed of: Nevertheless,because of the these small figures dimension shouldof the be taken corpus withwe could care, - sets of attributes: same as in the WN-model plus an use So, the lengths of the reference tracks for the five attribute SEG that gives the ID of the segment to characters in the golden corpus were as follows: which the current RE belongs and SYN – keeping its [Winston] – 23, [the girl] – 14, [the woman] – 3, syntactic role as a constituent near the main verb; [O’Brien] – 25 and [Goldstein] – 16 - knowledge sources: same as in the WN-model plus FDG (a functional dependency grammar parser Model 1 Model 2 Model 3 capable to fetch syntactic roles); P R P R P R - set of matching rules and heuristics: same as in the [Winston] 100 92 100 92 100 100 WN-model plus two heuristics: [the girl] 100 70 100 90 100 90 o CT-preference(fs,DRA): among the DEs in [the woman] 100 33 100 66 100 66 DRA (postponed candidates list of fs) prefers that [O’Brien] 81 72 81 72 90 72 one that is syntactically best placed [Goldstein] 100 50 100 50 100 50 (subject>direct-object>indirect-object>other) in the preceding unit Only when two or more Table 1: Precision and recall of models 1 to 3 applied members of the candidate list of the current FS to the five characters tracked are referred by REs belonging to the preceding discourse unit this heuristic is activated; Table 1 displays only the results of the first three o C-command(fs, DRA): this heuristic prevents two models because model 4 did not show any difference from REs belonging to the same discourse unit and model 3 These results are commented below having the subject and direct object positions to Model 1 resolves all pronouns-against-pronouns share the same referent; references and pronouns-against-proper nouns references - domain of referential accessibility: linear recency (he against a DE filled-in with only the information Placing Centering preference as a heuristic contributed by another he pronoun, or he against a implements a “default” vision on its applicability: it works Winston contributed DE) The smaller recall figures for only if any other constraints do not result in determining a [the girl] are due to failure to solve she against a DE certain candidate contributed by the common noun the girl, as the Base- Model 4 (VT-model): supplementary to the preceding model does not have the knowledge that [the girl] has a model, this model applies Veins Theory (Cristea, Ide and feminine natural gender As seen by the precision figure Romary, 1998) in order to determine a better domain of less then 100%, some of the references the model predicts referential accessibility for [O’Brien] are erroneous They will be corrected in - sets of attributes, knowledge sources and set of model 3 only when Centering will be applied Of the three matching rules and heuristics – same as in the references for [the woman] two fail One is due to a who Centering-model; pronoun, as the model is unprepared yet to treat who and - domain of referential accessibility: the combination of whom More interesting is the reference woman to a DE hierarchical E-DRA of VT with the recency ordering contributed by the noun phase the little sandy-haired woman This fails too although a lemma rule is mplemented that checked the lemma equality of the head 5 Results and conclusions iwords but its weight is let small to prevent, for instance, The easiness to configure AR models within the all rooms that are referred to in a story to be merged into presented framework allowed experiments intended to the same equivalence class The small recall for verify our initial hypothesis: models 1 to 4, adding more [Goldstein] in all models is explained because he is and more features and knowledge sources and being richer referred to as the Enemy, the renegade, backslider, one of and richer in matching rules, as well as in the criteria for the leading figures of the Party, the primal traitor, the defining the domains where to look for antecedents, do earliest defiler of the Party's purity, etc they lead to increasing factors of precision and recall? In Model 2 the recall figure for [the girl] is improved The following cases are still unresolved: O’Brien because now Wordnet knowledge allows linking she to a against a man called O’Brien (lack of insight into the DE contributed by the girl as well as a girl against she as constituents of a noun phrase), one of them against a girl in She was a bold-looking girl This is possible because in as in one of them was a girl (no rule to treat subject – 5 cases out of 6 the word girl is part of a synset which has predicative noun relations in nominal predicate as a hypernym the concept female and in the remaining constructions and no implemented rule to treat element-to- case the concept person is present This makes the SHE set relationships), O'Brien against a large, burly man as in score of the DE [the girl] be very high and therefore a O’Brien was a large, burly man (same reason) match between the anaphor she (which sets SHE to 1 and Difficulties in obtaining a corpus tagged for co- HE and IT to 0) and this DE be high also Note that this references prevented us from testing the models on a large matching rule uses Wordnet to compute a kind of average corpus We shall pursue this line in further research, trying natural gender out of all senses of the target word, without also to build and test models for different languages The trying to disambiguate its senses, in the context In the AR-Engine that the framework incorporates allows for an case of natural gender, an integration of all senses of a easy integration of features that are reported in other word in the attempt to compute an average behavior seems approaches Supplementary, although faithful to an to work pretty well, but in other cases, it proved to yield incremental type of processing, the engine allows unacceptable matches Woman against woman now also postponement of resolution until relevant information is succeeds on the combined ground of equal lemmas and acquired It is also able to accumulate values for features same natural gender of the discourse entities as the text unfolds Initially, Model 2 had another variant in which the Acknowledgments The authorsß would like to thank san for providing us with the FDG analysis synonymy and hypernymy relations were used in order to Constantin Oră identify co-reference based on synonymy and on the and NP extraction and Vlad Ciubotariu for the program ontology given by Wordnet concepts The experiments that helped to perform the manual co-reference proved however that, without a sense disambiguation annotation knowledge source, there is little chance that coherent resolutions of the kind hope – belief be found without 6 References adding many other accidental equivalences as well, as here: Azzam, S , Humphreys, K , and Gaizauskas, R 1998 DE59 ->RE965: the impression of being more Evaluating a Focus-Based Approach to Anaphora dangerous than most | RE977: The idea | RE1208: his Resolution Proceedings of the 17th Coling and the mind | RE1002: only a dim idea of its nature | RE1021: 36th Annual Meeting of the ACL (COLING-ACL'98) manner | RE1219: O'Brien's urbane manner | RE1053: a Montreal, Canada secretly held belief | RE1220: a belief | RE1055: a hope | Barlow, M 1998 Features Mismatches and Anaphora RE1223: intelligence | RE1079: this guess | RE1146: Resolution In Botley and T McEnery (eds) New death | Approaches to Discourse Anaphora S Technical Model 3: In our small corpus there have been found Papers Vol 11 only two cases that took advantage of the Centering Brennan, S E , Friedman, M E , and Pollard, C J 1987 A heuristic: Centering Approach to Pronouns Proceedings of the Winston had seen O'Brien perhaps a dozen times 25th Annual Meeting of the ACL, Stanford in almost as many years He felt deeply drawn to Chomsky, N 1981 Lectures on Governement and him, … Binding Dordrecht, the Netherlands Foris Publishers In this excerpt there is no other clue that he refers Cristea, D , Ide, N , and Romary, L 1998 Veins Theory: [Winston] and not [O’Brien] but the one given by A Model of Global Discourse Cohesion and Coherence, Centering Indeed if he would be [Winston] than the Proceedings of the 17th Coling and the 36th Annual transition between the two adjacent units would be Meeting of the ACL (COLING-ACL'98) Montreal, CONTINUE, while if he would be [O’Brien] the Canada transition would be RETAINING and the theory claims Cristea, D and Dima, G E , 2000 Anaphora and that CONTINUE is smoother than RETAINING and, Cataphora: what's in there, to appear in Proceedings of therefore, preferable to it The CT-preference heuristic the 5th TELRI Seminar, Ljubljana - Oct 2000 successfully links he to [Winston] Then the C-command Cristea, D and Dima, G E , 2001 An Integrating heuristic hinders him to refer to the same entity as he does Framework for Anaphora Resolution Information (a direct object is not allowed to refer a subject) and this Science and Technology, Romanian Academy restricts him to refer [O'Brien] Publishing House, Bucharest, vol 4, no 3 Still, in the example: Gordon, P C and Hendrick, R 1997 Intuitive knowledge It was a gesture which, if anyone had still of linguistic coreference Cognition, 62 thought in such terms, might have recalled offering and Processing of Coreference in Discourse Cognitive snuffbox> Science, 22 his correctly refers [an eighteenth-century nobleman] Grosz, B J , Joshi, A K , and Weinstein, S 1995 because the C-command heuristic does not apply Two Centering: a Framework for Modelling the Local new cases were resolved by model 3 with respect to model Coherence of Discourse, Computational Linguistics, 21 2 (2) Model 4: no long distance references were contained Hobbs, J R 1978 Resolving pronoun references Lingua, in our small corpus and therefore the application of the 44 Also in B Grosz, K Sparck-Jones and B Webber VT didn’t bring any difference between models 3 and 4 (eds ), Readings in Natural Language Processing, Morgan Kaufmann, Los Altos, 1986 Keenan, E and Comrie, B 1977 Noun phrase accessibility and universal grammar Linguistic Inquiry, 8 Kennedy, C and Boguraev, B 1996 Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser Proceedings of the 16th International Conference on Computational Linguistics, vol 1 Lappin, Y , Shalom, and Leass, Herbert J 1994 An Algorithm for Pronominal Anaphora Resolution Computational Linguistics, vol 20, n 4 Mitkov, R 1997 Factors in Anaphora Resolution: They Are not the Only Things that Matter A Case Study Based on Two Different Approaches In R Mitkov and B Boguraev (eds ) Proceedings of the Workshop "Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts", Universidad Nacional de Educación a Distancia, Madrid Mitkov, R, 2002 Anaphor resolution Longman Studies in Language and Linguistics Poesio, M , Vieira, R , and Teufel, S 1997 Resolving bridging references in unrestricted texts In R Mitkov and B Boguraev (eds ) Proceedings of the Workshop "Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts", Universidad Nacional de Educación a Distancia, Madrid Reinhart, T 1981 Definite NP anaphora and c-command domains Linguistic Inquiry, 12 Sidner, C 1981 Focusing for interpretation of pronouns American Journal of Computational Linguistics, 7 Tufis, D 2000 Using a Large Set of EAGLES-compliant Morpho-Syntactic Descriptors as a Tagset for Probabilistic Tagging Proceedings of LREC 2000, Athens Vonk, W 1985 The immediacy of inferences in the understanding of pronouns In G Rickheit and H Stronher (eds ), Inferences in text processing 